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Abstract 

We conduct a review to assess how the simulation of repeated or recurrent events are planned. For 
such multivariate time-to-events, it is well established that the underlying mechanism is likely to be 
complex and to involve in particular both heterogeneity in the population and event-dependence. In this 
respect, we particularly focused on these two dimensions of events dynamic when mimicking actual data. 
Next, we investigate whether the processes generated in the simulation studies have similar properties to 
those expected in the clinical data of interest. Finally we describe a simulation scheme for generating 
data according to the time-scale of choice (gap time/ calendar) and to whether heterogeneity and/or event- 
dependence are to be considered. The main finding is that event-dependence is less widely considered in 
simulation studies than heterogeneity. This is unfortunate since the occurrence of an event may alter the 
risk of occurrence of new events. 


1 Introduction 

Simulation studies that are performed for assessing statistical methods used in recurrent events analysis often 
rely on standardized event generation processes that cannot take into account some features of the data |T|. 
Indeed, it is well recognized in the literature on recurrent events that in most medical research applications, 
the underlying process is likely to be complex, involving both heterogeneity in the population, and event- 
dependence |f21. Heterogeneity arises if some subjects are more prone than others to experience events, owing 
to unkown or unmeasured factors, e.g. associated with lifestyle or genetic predisposition. This leads to intra¬ 
subject correlations between the waiting times separating consecutive events. Event-dependence refers to the 
case when the occurrence of one event may render the occurrence of subsequent ones more or less likely, e.g. 
as a result of a learning or immunity process, of a weakening of the body after an event or of an intervention 
subsequent to each event. The aim of this paper is to review the simulation methods that are used in the 
literature, focusing on those which generate data according to the features of actual data that are encountered 
in practice, so that they could be recommended for wider use. While, in many articles, heterogeneity is 
included in the model generating the simulated data, such is not the case of event-dependence, as will be 
seen. This work aims to describe the useful generating methods in detail and to underline the relevance of 
each of them for considering data of different types. In section |2l the literature is reviewed in order to assess 
how recurrent events processes are generated, and how they match the events dynamic of the biomedical 
data motivating these articles. The simulations performed are compared with the clinical studies considered. 
Section [3] describes methods for generating data according to the scenario of interest. A discussion follows 
in which the benefits of the methods are compared and some guidelines are suggested. 
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2 Results of the literature search 


2.1 Introduction to the intensity models used 

The simulation studies included in the following review mostly rely on modeling intensity or include pro¬ 
cesses that can be written as intensity-based processes. We focus on multiplicative proportional hazards 
intensities as they are the most frequent in the literature. The intensity process may be defined in one of two 
common timescales: the calendar-time and the gap-time. In calendar-time, the time denotes the time elapsed 
since the origin. In gap-time, the time is reset to zero when an event occurs, so that the time corresponds to 
the time elapsed since the previous event lUl. 

In calendar-time, for a vector of external covariates X, the common multiplicative proportional intensity 
A(.) can be written: 

A(f|X = x) = Ao(f) exp(/?'x). (1) 

In gap-time, the common multiplicative proportional intensity becomes: 

X{t\X = x) = ho{t - Tjvp-)) exp(^'x). (2) 

The models are described in more detail in section [3] 

2.2 How recurrent events processes are generated in the literature 

We conducted a literature research in six journals: Biometrics, Biometrika, Computational Statistics and 
Data Analysis, Journal of the Americal Statistical Association, Lifetime Data Analysis and Statistics in 
Medicine. This search was performed using the journals’ webpages on October 10*^, 2011, focusing on 
articles published between 1996 and this date. In total, we found 135 articles that featured the term ’recurrent 
events’, ’repeated events’ or ’repeated failures’, in the title and/or in the key words. Among these, 102 
featured simulation studies of recurrent events. 

The generating intensity processes could be classified according fo timescale, considerafion of hefero- 
geneify, and consideration of evenf-dependence. In fhe simple case where fhe baseline intensify is consfanf, 
fhe process can be expressed equivalenfly in eifher timescale (see Section [3]l, so Tables [U and [2] include a 
cafegory ’’consfanf baseline” in addifion fo fhe cafegories ’’gap-fime” and ”calendar-fime”. A fofal of 92 
arficles could be classified in fhis way. In fhe sequel, fhe focus will be on fhese 92 articles. The 10 remaining 
arficles were nol classified, for example because fhe focus was nol on fime of evenfs buf only on fhe number 
of evenfs over given infervals. 

The resulfs are summarized in Table [Jin which simulation sfudies are classified according fo timescale 
and fype of populafion (homogeneous or heferegeneous), and in Table|2l in which simulafion sfudies are clas¬ 
sified according fo timescale and whefher or nol fhe generation processes included evenf-dependence. Note 
lhal one arlicle can appear in more lhan one cell in each of fhese fables if several scenarios are considered in 
fhe simulafion sludy. 

Among fhe 92 arficles, 54 (59%) fealured exlusively processes wifh consfanf baseline. Note lhal a con- 
slanl baseline does nol exclude lime-dependenl covariales. Only 13 (14%) considered processes wifh a 
baseline lhal was a funcfion of lime since fhe previous evenl (gap-lime), and 28 (30%) fealured processes 
wifh a non-consfanl calendar baseline. 

Among fhe arficles wifh a baseline lhal varied wifh lime, 3 arficles S |5l |6l also considered a limescale 
lhal was in-belween fhe gap-fime and fhe calendar-lime: some evenfs caused a ’’perfecl repair” lhal led fhe 
baseline fo reslarl al 0, while for olher evenfs fhe baseline continued as in a calendar timescale. 

Helerogeneily among subjecls was widely considered: 80 arficles (87%) included in fhe generation pro¬ 
cess a subjecl-specific random effecl lhal was usually gamma-disfribufed. 
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Event-dependence was more rarely considered : only 14 (15%) of the simulation studies feature this 
scenario. It was modeled through an internal covariate, an event-specific baseline or an event-specific random 
effect. 

In addition, Table [T] suggests that the situation of a heterogeneous population is seldom considered when 
the baseline hazard is time-varying and formulated in gap-time (in 7 articles among the 13 featuring processes 
generated according to this timescale). Table |2] suggests that the situation of event-dependence is less widely 
considered in simulation studies using a time-varying baseline hazard in a calendar timescale (in 2 articles 
ISIITI among the 28 using this timescale). 

The list of papers included in this review are available as supplementary material. 

2.3 Comparison of generating process and data of interest in two examples 

In most of the simulation studies considered, the statistical methods investigated were also used in the anal¬ 
ysis of biomedical data. In this section, we focus on two examples of applications that are commonly 
encountered in the recurrent events analysis setting, and investigate whether the processes generated in the 
simulation study have similar properties to those expected in the data of interest. 

2.3.1 Recurrent events associated with asthma 

Asthma is the disease of interest in several studies involving recurrent events. Three articles in our literature 
search analyze data from asthma studies |l8l|9l[TQ. The events of interest are not always the same. The focus 
was on hospitalisations or physicians’ visits attributable to asthma in the study by Cai and Schaubel |81, on 
exacerbations in the study by Chen and Cook f9), and on coughing episodes in the study by Cook et g/ llTOl . 
However, these different types of events are expected to reflect the general evolution of the disease. 

Asthma is expected to be a stable disease that does not evolve much with time, at least in adults ifTTl . 
which is the population of interest in two of the studies mentioned lUITOl. However, in young children, 
Cai and Schaubel l|3 found that the risk of event was increased strongly between 0 and 2 years old and was 
decreased after 3 years old. Heterogeneity is to be expected as it is well-known that there are various degrees 
of severity of asthma ifT^ . The fact that asthma is considered as a stable disease suggests that at least in 
adults there is no event-dependence and no time-dependence. 

The three simulation studies considered H] |9l [TOl all relied on simulated processes that had a constant 
baseline and featured no event-dependence but were heterogeneous across the sample. This matches the 
expected events dynamic of the data processes, except maybe for the asthma study by Cai and Schaubel 
which focuses on children, for whom time-dependence is to be expected. 

2.3.2 Recurrent infections in patients with chronic granulomatous disease 

Among the articles considered in the review of simulation studies, seven E |T3l [lH [15] [161 HZl HU in¬ 
vestigated an application to a dataset from a clinical trial that was originally introduced by Flemming and 
Harrington ll^ . which studies the effect of gamma interferon in patients with chronic granulomatous dis¬ 
ease (CGD). This double-blind trial focused on the occurrence of recurrent infections among 128 patients 
randomised between placebo and gamma interferon. 

The expected clinical course of CGD is not well-known since it is a relatively rare condition (about one 
in 250000 individuals) ll20l . It is therefore not obvious whether one should expect time-varying hazards, but 
in the absence of evidence this possibility should not be discarded. Moreover, one study pointed out that 
the incidence for some particular types of infections increases during follow-up. Similarly, event-dependence 
has received little attention, but the analysis of the dataset of the gamma interferon trial by Box-Steffensmeier 
and De Boef |[2l provides an indication of event dependence, specifically that the occurrence of an infection 
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makes recurrence more likely. Additionally, it is well-known that there is much clinical heterogeneity among 
patients with CGD |[22ll2^ . 

Among the seven simulation studies presented in articles featuring an application to the CGD data, 
two considered only homogeneous processes ifTSlfT^ . A time-varying baseline in a calendar timescale was 
considered in four |[T5llT4l[T7llT8l of the seven studies, which is more than expected given the total proportion 
found in the review (30%). Event-dependence, however, was considered only in one simulation study |5j|- 

3 Simulations based on intensity-based models 

The simulation studies included in the review in Section |2] were based on the modeling intensity or could 
be written as intensity-based processes. This intensity could be additive or more commonly multiplicative. 
Accelerated failure time models, which correspond to non-proportional intensities, are seldom used in this 
setting. In this section, we present examples of intensity processes for each scenario defined in Section |2] 
We focus on multiplicative proportional hazards intensities as they are the most frequent in the literature. 

For a given suject, let Tj denote the time elapsed from the origin to the event, Tq = 0, and let 
Wj = Tj — Tj_i denote the waiting time between two consecutive events. The Wj are referred to as gap 
times. 

Whichever intensity process is considered, a first step is to generate a censoring time C, and external 
covariates X if necessary. Then, the event times Tj (or the gap times Wj) are generated until the censoring 
time is reached. The censoring time is usually generated according to an exponential or uniform distribution. 

3.1 Standard model in each timescale 

If the baseline is constant, the recurrent event process can be considered equivalently as an homogeneous 
Poisson process or a sequence of exponentially distributed gaps, which makes the generation process straight¬ 
forward. A constant baseline is the most widely considered scenario. However, it may be beneficial fo con¬ 
sider different generation processes that involve time-varying baselines using the two possible timescales. 

3.1.1 Calendar timescale 

When the relevant timescale is the time elapsed since inclusion, the baseline is specified in a calendar 
timescale. In particular, calendar-time is the proper timescale in the case of a disease that evolves in the 
long run. For example, the calendar timescale is used when analyzing exacerbations in patients with chronic 
bronchitis, which often arise as a consequence of impaired lung function due to the underlying disease ll24l 
p233]. 

The standard model is a non-homogeneous Poisson process, with an intensity given by ([Hi- 
The most common choice is a Weibull intensity, with Ao(f) = where A is a scale parameter and v 

a shape parameter. The Weibull distribution allows the baseline to increase (if > 1) or decrease (if u < 1) 
over time. 

There are several ways to simulate a non-homogeneous Poisson process. 

The first is the inversion method. The gap time Wj of a non-homogeneous Poisson process has a 
cumulative distribution function Fj{.) equal to: 


Fj{w) = 1 - P{Wj > w) 

= 1 - P{N{Tj_^ +w)- N{Tj_^) = 0}. (3) 

The number of events observed for a non-homogeneous Poisson process with intensity A(.) between two 
times s and t follows a Poisson distribution with parameter f* \{u)du, which leads to: 
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N{Tj_i + w)- N{Tj_i) ~ r 


a 


■Tj-i+w 



( 4 ) 


And (O becomes : 



1 — exp 



(5) 


The successive event times are then simulated according to the following algorithm: 

1. Let Tq = 0. 

2. For the event, simulate Vj ~ Zi[0,1]. 

3. Let Wj = F-\Vj) and Tj = T,_i + Wj. 

4. If Ti < C, return to step 2 for the next event. Otherwise, the 7 '*^ event time is censored: set T, = C 
and Wj = C- Tj_i. 

In the Weibull intensity in model ([T]), the event times simulated are given by: 



Note that if Vj ~ U[Q, 1], then 1 — ~ ZY[0,1], and log(l — Vj) can be replaced by log(V^) in this 

expression. 

A limitation of this method is that it requires Fj{.) to be invertible. If not, numerical methods may be 
used to obtain Wj such that Fj{Wj) = Vj. 

Another common method is referred to as thinning |[25l . First, a value A is chosen such that Vt, \ {t) < A. 

If times Tf, T 2 *... are generated from a homogeneous Poisson process with rate A, the process in which each 

\{T*) \(T*) 

time T* has a probability —^ of being an actual event time, and the complementary probability 1-^ 

to be discarded, is a process with intensity A(.). The corresponding generating algorithm is as follows: 

1. Let Tq = 0,T* = 0, j = 1. 

2. Simulate E ~ Exp{X), and let j'* = j'* ^ jjj 

3. Simulate V ~ ZF[0,1]: if lA < ~ event time. Then set j = j + 1, and return to 

\{T *) 

step 2. If lA > -J , T* is not an event time, go back directly to step 2. 

The assumption that the intensity A(.) is an upper-bounded by a value A is not as restrictive as it might 
seem, since processes are usually generated over a finite time interval ll24l . This second method is often 
slower than the inversion method and is usually used when the inverse of the cumulative distribution func¬ 
tions Ej{.) is not easily calculated. 
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3.1.2 Gap timescale 


When the relevant timescale is the time elapsed since the previous event, the baseline is specified in gap¬ 
time. The gap-time is relevant for example when generating a risk of relapse after a complete response to 
treatment [51. The basic model is a renewal process, with an intensity given by (jj)). 

The Weibull distribution is again a natural choice, with ho{w) = allowing for a risk of event 

that increases or decreases starting from the occurrence of the previous event. 

The successive i.i.d. gap times can be simulated in the same way survival times are simulated when 
the hazard function is time-varying Il26l . The cumulative distribution function for the gap time Wj is : 
Fj{w) = 1 — exp{Tfo('M^) exp(/?'x)}. Since FjiWj) ~ Z^[0,1], ex.-p{—Ho{w) exp{l3'x)} ~ Z^[0,1]. The 


gap time Wj can therefore be simulated by generating Vj ~ ZY[0,1], and setting Wj = Hq 


-1 


exp(/3'tr) 


. In 


the case of the Weibull distribution, it leads to Wj = ( — 


A exp(/3'x) 


New gap times are simulated until the sum of the gap times exceeds the censoring time: the last gap time 
is then censored. 

When the cumulative hazard is not easily invertible, one may use numerical methods to solve HQ{Wj) exp(/3'x) 
-log(V,). 

Again, an alternative method is to use an acceptance-rejection type algorithm ETl pl70] adapted from 
the thinning method proposed by Lewis and Shedler for non-homogeneous Poisson processes |[25l . 

Denote h an upper-bound of the hazard function h(.) over the period considered. Each gap time Wj is 
simulated according to the following algorithm: 


1. Set Wq = 0 and W* = 0. 

2. Generate E ~ Exp{h), and let W* = W* + E. 

3. Generate Vj ~ ZY[0,1]. Wj < \ W* is the simulated gap time Wj. Otherwise return to step 

2 . 


Again, new independent gap times are simulated until the sum of the gap times exceeds the censoring time: 
the last gap time is then censored. 


3.2 Heterogeneity 

Heterogeneity in the population is often realistic in biomedical studies and is widely considered in simulation 
studies designed to evaluate the properties of statistical methods. In both timescales, heterogeneity can be 
easily introduced by generating a subject-specific random effecf. This random effecf is usually assumed to 
follow gamma or lognormal distribution, but other distributions are possible. For instance, one may use a 
binary random effect to investigate model properties when binary covariates are omitted. 

In the calendar timescale, the resulting process is referred to as a mixed Poisson process, i.e. the individ¬ 
ual processes are Poisson processes with subject-specific intensities. The usual multiplicative proportional 
intensity can then be written : 


X{t\U = u,X = x) = Xo{t)uexp{/3'x), (6) 

where u is the subject-specific random effect. A simple simulation scheme consists in first generating 
independent individual frailties according to the chosen distribution. The individual intensities are obtained 
from ®. The individual processes can then be simulated as Poisson processes as described in Section [3. 1.1 1 
In gap-time, the result is a shared frailty model, with the corresponding intensity : 

X{t\U = u,X = x) = ho{t — Tjvp-))uexp(/3'x). 
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Similarly, independent individual frailties are first generated. The gap times for each subject can then be 
simulated as described in Section [3X2] 


3.3 Generating recurrent event with event-dependence 

Event-dependence is less widely considered in simulation studies than heterogeneity. However, for many 
recurrent events data, it is realistic to consider that the occurrence of an event may alter the risk of occurrence 
of new events. For this reason, it is useful to investigate the properties of statistical methods under event- 
dependence, e.g. when the occurrence of an event increases the risk of occurrence of subsequent ones. 

3.3.1 Event-dependence in gap-time 

It should be noted that a baseline that is a function of time in gap-time constitutes a form of event-dependence, 
since the occurrence of an event causes the hazard to reset back to its original level. However, additional 
event-dependence may be appropriate. A common assumption is to consider that the occurrence of an event 
increases the risk of further ones (e.g. by making subjects more frail and therefore prone to experiencing 
events), or on the contrary decreases it (by making subjects immune or more vigilant). A natural way to 
generate such data is to take an event-specific baseline or an infernal fime-dependenf covariafe fhaf induces 
dependence |[28ll . as did Cook et al ll29l in fheir simulation sfudies. 

A simple example is fo multiply fhe baseline by wifh j — 1 fhe number of previous evenfs. Setting 
a <1 will cause fhe risk of evenf fo decrease wifh fhe number of pasf evenfs while setting a > 1 will cause 
fhe risk fo increase wifh fhe number of evenfs. This is similar fo including fhe number of previous evenfs as 
an infernal covariafe in a Cox-fype model wifh associated paramefer ln(Q:). 

However, if fhe risk increases exponentially afler each even!, fhe even! process may ’’explode”, wifh a 
number of evenfs fhaf fends fo infinify for a finite time ll30l . A common solufion is fo consider fhe risk fo 
be sfabilised affer a pre-specified number of evenfs, e.g. 4 evenfs. Considering a Weibull baseline and a risk 
fhaf is sfable afler 4 evenfs, fhe dala generafion model wifhoul covariales becomes : 



Usually, when considering a gap-fimescale, fhe infernal covariafe fhaf induces fhe dependence belween 
evenfs is consfanl for each gap-lime and includes fhe number or timing of previous evenfs. The successive 
evenf times Wj can Ihen be simulated as described in Section 13.1.21 wifh a differenl cumulative dislribulion 
funclion for each evenf. 

However, more general processes can be considered, wifh infernal lime-dependenl covariales fhaf can 
be non-conslanf over fhe af-risk inferval for a given evenf. The melhods described in fhe nexl secfion for 
calendar-lime are general and valid for such processes. 

3.3.2 Event dependence in calendar-time: generating dynamic covariates 

Inducing event-dependence in a calendar timescale is usually done by considering an intensity with an inter¬ 
nal (or dynamic) covariafe that depends on the past of the process. An obvious choice is N{t~) to allow the 
intensity to increase or decrease with the number of past events. Such a generating process was considered 
by Gonzalez et al [5]. However, just as in gap-time, the risk is that the number of events will increase enor¬ 
mously when considering N{t~) as a covariafe. Other possible covariates are N{t~)/t, in order to simulate 
processes in which each event leads to an increased intensity, but where the effect of this increase fades with 
time, min{N (f“), ifo) to consider the case where the risk becomes stable after Kq events, or )-N{t-u) 
where u < t, in order to focus on recent history (on a period of length u) as suggested by Elgmati et al liXI . 
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The inversion method described in Section [3.1.1l for non-homogeneous Poisson processes can be used 
for more general processes, with an intensity that depends on the past. Indeed, Cook and Lawless |[24l p30] 
showed that: 


P{Wj > w\Tj_i 


= exp ( - / 

V 


tj-l+W 


X{u\H {u))du 


where H(t) is the history of the event process up to time t (not included). The intensity is therefore considered 
deterministic on the interval ]tj-i,tj-i + m]. This makes it possible to simulate a process defined by an 
intensity conditional on the past by the inversion method in the same way as a non-homogeneous Poisson 
process, by generating Fj{Wj) ~ ZY[0,1] with Fj{w) = F{Wj < w\Tj-i^H{Tj-i)). 

For example, the process with intensity: 


A(f) = Ao(f) exp(/3'x + )), 

has a cumulative distribution function for the gap time: 

ftj-l+W 


Fj (w) = 1 — exp — 


Jtj—i 


\o{u) exp{j3'x + (p{j - l))du . 


When the baseline intensity is Weibull, generating 1 — Fj{Wj) = ~ (^[0,1] leads to: 


Tj + \exp{l3'x + <p(j-1)) ' / 

When considering the covariate min{N(t~), Ko) with a Weibull baseline intensity, the event times be¬ 
come: 


j.. ^ (_ log(^j) _ 

^ ' Xeyip{/3'x + (p{min{j — l,Ko))) 


+ TU 


More generally, an intensity process: 


X{t) = ex.p{go{t) + I{N{t )>Q)gi{t-N{t )) + g 2 {N{t )) + ^'x), 

which is a function of the time in a calendar timescale, the time in gap-time and the number of past events, 
leads to a cumulative distribution function of the gap time : 


Fj{w) = exp j exp{po(fj-i + u) + I{j 


-r g 2 U 


FjiWj) (i[0,1] is then generated and Wj is obtained from O analytically or numerically. In the 
case of the internal covariates N{t~)/t or )-N{t-u) before, a closed-form expression for Wj 

cannot easily be obtained. 

Another possibility is to use the thinning method described in Section 13.1.11 which was introduced by 
Lewis and Shedler |[25l for non-homogeneous Poisson processes, but was later proved to be valid when the 
intensity depends on the past 1321. In the algorithm described in Section [3. 1.11 X{T*) can therefore depend 
on the past. 

A final possibility fhaf does nol require an upper bound for fhe intensify or a numeric inversion of fhe cu¬ 
mulative disfribufion funcfion is based on a discrefe approximafion. A similar mefhod was used for example 
by l33l. A partition of fhe time period is defined by intervals of lengfh At: [kAt, {k + 1) Af], k = 0,1, 2 ,.... 
The lengfh At is chosen fo be small, so fhaf one can assume fhaf a subjecf cannof have more fhan one even! in 











each interval [kAt, [k + 1) Ai]. The occurrence of an event during the interval [kAt, {k + 1) Af] is generated 
with an event probability X{kAt)At, which approximates the probability of an event occurrence correspond¬ 
ing to (IHl. The intensity is then updated for the next interval. For example, if At corresponds to a day, the 
occurrence or not of an event is simulated every day. According to this approximation, simulating a process 
with an intensity \o{t) exp(/3'x + where Ao(.) is expressed as a function of time in days leads to 

consider that the subject has an event on the k^^ day with a probability Ao(A:) exp(/?'x + )■ This 

method of simulating general event processes is simple, but can also be slow. 

4 Discussion 

This review complements the extensive simulation study conducted by Metcalfe and Thompson |T1. Their 
study exemplified the fact that no model is adequate for all settings, and that it is necessary to investigate 
several situations in simulation studies, including heterogeneity and event-dependence, in order to have 
satisfactory knowledge of the properties of a given statistical method. 

In this review, we show that heterogeneity has received much attention. By contrast, event-dependence 
is seldom considered in simulation studies, especially when the event-generation process is expressed in 
calendar-time. This is unfortunate since event-dependence is known to have a critical effect on statistical 
models, in terms of properties and interpretation of the covariates effects ||34l. One reason for neglecting this 
possibility may be that the generation of non-Poisson processes is not obvious. In this paper, the general rela¬ 
tion between the intensity process and the successive gap times is recalled and examples of event-dependent 
processes are provided, in both timescales. Additionally, as noted by Bender et al ll26l for univariate survival 
times, most simulation studies use only constant baseline hazards corresponding to exponential gap times. 
However, it was pointed out by Duchateau et al ifTTl that the choice of the timescale in the statistical model 
may considerably impact the results. This suggests that a given model may demonstrate different proper¬ 
ties in simulation studies according to the timescale of the generation process, and that it would be wise to 
consider generation processes with time-varying baselines in both timescales. 

Extensive simulation studies that investigate the properties of a given statistical method should there¬ 
fore include, in our opinion, datasets generated by using a mixed Poisson process, processes with constant 
baseline, Weibull calendar-time, and Weibull gap-time baseline, and processes with event-dependence e.g. 
with the number of previous events included as a covariate in the intensity generation process. Such studies 
would assist the applied statistician in choosing the appropriate method in a particular biomedical context. 
When a statistical method is suggested for a biomedical context, the scenario considered for the simulation 
studies should match the expected dynamics of the event-process in this context. If a long-term evolution of 
the disease cannot be ruled out, calendar-time-varying intensity processes should be included. This would 
be the case for exacerbations in patients with chronic brochitis ll2^ p233]. A baseline varying with time in 
gap-time should be considered in the case when an event is followed by an intervention that leads to a kind of 
renewal of the subject. This was considered in particular in the context of cancer relapse where a complete 
response was possible lO. Heterogeneity should in general be included since there is no biomedical context 
in which one can be sure that all risk factors are known and always measured. Event-dependence should be 
considered in contexts in which an event could alter the risk of subsequent events. Eor example, this was 
considered in a simulation study by Kvist et alf]\, in the context of affective disorder, in which an important 
theory is the sensitisation theory, i.e. that the occurrence of an event may increase the risk of subsequent 
events. 
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Table 1: Results of literature seareh : generated proeesses elassified aeeording to timeseale and type of 
population (homogeneous/heterogeneous) (eells are non-exelusive) 



Homogeneity 

Heterogeneity 

Total 

Constant baseline hazard 

28 (30%) 

62 (67%) 

70 (76%) 

Baseline hazard in gap-time 

8 (9%) 

7 (8%) 

13 (14%) 

Baseline hazard in calendar-time 

12 (13%) 

25 (27%) 

28 (30%) 

Total 

38 (41%) 

80 (87%) 

92 


Table 2: Results of literature seareh : generated proeesses elassified aeeording to timeseale and inelusion of 
event-dependenee (eells are non-exelusive) 



No event-dependence 

Event-dependence 

Total 

Constant baseline hazard 

64 (70%) 

13 (14%) 

70 (76%) 

Baseline hazard in gap-time 

12 (13%) 

3 (3%) 

13 (14%) 

Baseline hazard in calendar-time 

27 (30%) 

2 (2%) 

28 (30%) 

Total 

83 (90%) 

14 (15%) 

92 
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